Genome Majority Vote Improves Gene Predictions

نویسندگان

  • Michael E. Wall
  • Sindhu Raghavan
  • Judith D. Cohn
  • John Dunbar
چکیده

Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbial gene maps. Our simple and logical solution provides a notable advance toward accurate gene maps.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vote Trading With and Without Party Leaders

Two groups of voters of known sizes disagree over a single binary decision to be taken by simple majority. Individuals have different, privately observed intensities of preferences and before voting can buy or sell votes among themselves for money. We study, theoretically and experimentally, the implication of such trading for outcomes and welfare when trades are coordinated by the two group le...

متن کامل

Momresp: A Bayesian Model for Multi-Annotator Document Labeling

Data annotation in modern practice often involves multiple, imperfect human annotators. Multiple annotations can be used to infer estimates of the ground-truth labels and to estimate individual annotator error characteristics (or reliability). We introduce MOMRESP, a model that improves upon item response models to incorporate information from both natural data clusters as well as annotations f...

متن کامل

Forecasting Life and Death : Juror Race , Religion , and Attitude toward the Death Penalty

Determining whether race, sex, or other juror characteristics influence how capital case jurors vote is difficult. Jurors tend to vote for death in more egregious cases and for life in less egregious cases no matter what their own characteristics. And a juror's personal characteristics may get lost in the process of deliberation because the final verdict reflects the jury's will, not the indivi...

متن کامل

The Dark Side of the Vote : Biased Voters , Social

The Dark Side of the Vote: Biased Voters, Social Information, and Information Aggregation Through Majority Voting by Rebecca B. Morton, Marco Piovesan and Jean-Robert Tyran* We experimentally investigate information aggregation through majority voting when some voters are biased. In such situations, majority voting can have a “dark side,”that is, result in groups making choices inferior to thos...

متن کامل

Ensembles of nearest neighbour classifiers and serial analysis of gene expression

In this paper, we represent experimental results obtained with ensembles of nearest neighbour classifiers on the binary classification problem of cancer classification using serial analysis of gene expression (SAGE) data. Nearest neighbours are selected as classifiers since they were rarely employed in building ensembles because their predictions are stable to small perturbations of data, which...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2011